GenePC and ASPIC Integrate Gene Predictions with Expressed Sequence Alignments To Predict Alternative Transcripts

نویسندگان

  • Tyler S. Alioto
  • Roderic Guigó
  • Ernesto Picardi
  • Graziano Pesole
چکیده

We have developed a generic framework for combining introns from genomicly aligned expressed–sequence–tag clusters with a set of exon predictions to produce alternative transcript predictions. Our current implementation uses ASPIC to generate alternative transcripts from EST mappings. Introns from ASPIC and a set of gene predictions from many diverse gene prediction programs are given to the gene prediction combiner GenePC which then generates alternative consensus splice forms. We evaluated our method on the ENCODE regions of the human genome. In general we see a marked improvement in transcript-level sensitivity due to the fact that more than one transcript per gene may now be predicted. GenePC, which alone is highly specific at the transcript level, balances the lower specificity of ASPIC.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ASPic-GeneID: A Lightweight Pipeline for Gene Prediction and Alternative Isoforms Detection

New genomes are being sequenced at an increasingly rapid rate, far outpacing the rate at which manual gene annotation can be performed. Automated genome annotation is thus necessitated by this growth in genome projects; however, full-fledged annotation systems are usually home-grown and customized to a particular genome. There is thus a renewed need for accurate ab initio gene prediction method...

متن کامل

Leveraging EST Evidence to Automatically Predict Alternatively Spliced Genes, Master's Thesis, December 2006

Current methods for high-throughput automatic annotation of newly sequenced genomes are largely limited to tools which predict only one transcript per gene locus. Evidence suggests that 20-50% of genes in higher eukariotic organisms are alternatively spliced. This leaves the remainder of the transcripts to be annotated by hand, an expensive time-consuming process. Genomes are being sequenced at...

متن کامل

P-121: Cloning and Expression of The Inosine Triphosphate Pyrophosphatase Gene Variant II in E.coli

Background Environmental and cellular inappropriate conditions can cause damages to cells nucleotide poll. Deamination and oxidation damages interfere with cell�s vital reactions. Inosine triphosphate pyrophosphatase (ITPA), an evolutionary conserved enzyme, plays a critical role in elimination of non-canonical bases. In human genome, the ITPA gene is located on chromosome 20 short arm and tran...

متن کامل

Using native and syntenically mapped cDNA alignments to improve de novo gene finding

MOTIVATION Computational annotation of protein coding genes in genomic DNA is a widely used and essential tool for analyzing newly sequenced genomes. However, current methods suffer from inaccuracy and do poorly with certain types of genes. Including additional sources of evidence of the existence and structure of genes can improve the quality of gene predictions. For many eukaryotic genomes, e...

متن کامل

ASPicDB: A database resource for alternative splicing analysis

Motivation: Alternative splicing has recently emerged as a key mechanism responsible for the expansion of transcriptome and proteome complexity in human and other organisms. Although several online resources devoted to alternative splicing analysis are available they may suffer from limitations related both to the computational methodologies adopted and to the extent of the annotations they pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008